Automatic Argumentative-Zoning Using Word2vec

نویسنده

  • Haixia Liu
چکیده

In comparison with document summarization on the articles from social media and newswire, argumentative zoning (AZ) is an important task in scientific paper analysis. Traditional methodology to carry on this task relies on feature engineering from different levels. In this paper, three models of generating sentence vectors for the task of sentence classification were explored and compared. The proposed approach builds sentence representations using learned embeddings based on neural network. The learned word embeddings formed a feature space, to which the examined sentence is mapped to. Those features are input into the classifiers for supervised classification. Using 10-cross-validation scheme, evaluation was conducted on the Argumentative-Zoning (AZ) annotated articles. The results showed that simply averaging the word vectors in a sentence works better than the paragraph to vector algorithm and by integrating specific cuewords into the loss function of the neural network can improve the classification performance. In comparison with the hand-crafted features, the word2vec method won for most of the categories. However, the hand-crafted features showed their strength on classifying some of the categories.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Critiquing of Novices’ Scientific Writing Using Argumentative Zoning

Scientific writing can be hard for novice writers, even in their own language. We present a system that applies Argumentative Zoning (AZ) (Teufel & Moens 2002), a method of determining argumentative structure in texts, to the task of advising novice writers on their writing. We address this task by automatically determining the rhetorical/argumentative status and the implicit author stance of a...

متن کامل

Accurate Argumentative Zoning with Maximum Entropy models

We present a maximum entropy classifier that significantly improves the accuracy of Argumentative Zoning in scientific literature. We examine the features used to achieve this result and experiment with Argumentative Zoning as a sequence tagging task, decoded with Viterbi using up to four previous classification decisions. The result is a 23% F-score increase on the Computational Linguistics co...

متن کامل

CoZo+ - A Content Zoning Engine for textual documents

Content zoning can be understood as a segmentation of textual documents into zones. This is inspired by [6] who initially proposed an approach for the argumentative zoning of textual documents. With the prototypical Cozo+ engine, we focus on content zoning towards an automatic processing of textual streams while considering only the actors as the zones. We gain information that can be used to r...

متن کامل

ArgMine: A Framework for Argumentation Mining

The aim of argumentation mining is the automatic detection and identification of the argumentative structure contained within a piece of natural language text. In this paper we present the ArgMine Framework: an alignment of tools and processes that facilitate and partially automate argumentation mining research. We also report on a preliminary exploitation of the framework, where we address arg...

متن کامل

A Weakly-supervised Approach to Argumentative Zoning of Scientific Documents

Argumentative Zoning (AZ) – analysis of the argumentative structure of a scientific paper – has proved useful for a number of information access tasks. Current approaches to AZ rely on supervised machine learning (ML). Requiring large amounts of annotated data, these approaches are expensive to develop and port to different domains and tasks. A potential solution to this problem is to use weakl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1703.10152  شماره 

صفحات  -

تاریخ انتشار 2017